SQUARING RECTANGLES FOR DUMBBELLS 



J. W. CANNON, W. J. FLOYD, AND W. R. PARRY 

Abstract. The theorem on squaring a rectangle (see Schramm 6 and Cannon- 
Floyd-Parry [T|) gives a combinatorial version of the Riemann mapping the- 
orem. We elucidate by example (the dumbbell) some of the limitations of 
rectangle-squaring as an approximation to the classical Riemamnn mapping. 



1. Introduction 

A quadrilateral is a planar disk D with four distinguished boundary points a, 
b, c, and d that appear in clockwise order on the boundary. These four points 
determine a top edge ab, a right edge be, a bottom edge cd, and a left edge da. A 
tiling of D is a finite collection of disks Tj, called tiles, whose union fills D, whose 
interiors Int Tj are disjoint, and whose boundaries form a finite graph T in D that 
contains dD. 

The rectangle-squaring theorem (see Schramm [6] and Cannon-Floyd-Parry pQ) 
states that there are integers pi > parametrized by the tiles (for a tile Tj, pi is 
called the weight of Tj), not all equal to 0, unique up to scaling, such that squares 
Si of edge length pi can be assembled with essentially the same adjacencies as the 
tiles Tj to form a geometric rectangle with top, bottom, and sides corresponding 
to the edges of D. Some combinatorial distortions are inevitable. For example, 
in a squared rectangle, at most four tiles can meet at a point. The distortions 
allowed are these: (1) a vertex may expand into a vertical segment and (2) a tile 
may collapse to a point. No other distortions are required. 

The rectangle-squaring theorem is essentially a combinatorial version of the Rie- 
mann Mapping Theorem. It has the advantage over other versions of the Riemann 
Mapping Theorem that the integers pi can be calculated by a terminating algorithm 
and can be approximated rapidly by various simple procedures. As a consequence, 
rectangle-squaring can be used as a rapid preprocessor for other computational 
methods of approximating the Riemann mapping. 

Schramm 6 has pointed out that tilings given by the simplest subdivision rules 
give squarings that need not converge to the classical Riemann mapping. The 
purpose of this paper is to further elucidate the limitations of the method. In 
the classical Riemann mapping, changing the domain of the mapping anywhere 
typically changes the mapping everywhere. Our main result shows that this is not 
true for rectangle squaring. A dumbbell is a planar quadrilateral, constructed from 
squares of equal size, that consists of two blobs (the left ball and the right ball) 
at the end joined by a relatively narrow bar of uniform height in the middle. We 
show that for any choices of the left and right balls of a dumbbell, the weights pi 
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associated with the squares in the middle of the bar are constant provided that the 
bar is sufficiently long and narrow. In particular, subdivision cannot lead to tilings 
whose squarings converge to the classical Riemann mapping. 

In order to state our main theorem, we precisely define what we mean by a 
dumbbell. A dumbbell D is a special sort of conformal quadrilateral which is a 
subcomplex of the square tiling of the plane. It consists of a left ball, a right ball, 
and a bar. The left ball, the bar, and the right ball are all subcomplexes of D. 
The bar is a rectangle at least six times as wide as high. It meets the left ball in 
a connected subcomplex of the left side of the bar, and it meets the right ball in a 
connected subcomplex of the right side of the bar. The bar of a dumbbell is never 
empty, but we allow the balls to be empty. The bar height of D is the number of 
squares in each column of squares of the bar of D. Figure [T] shows a dumbbell with 
bar height 1. 




Figure 1. A dumbbell D 

If D is a dumbbell with bar height n, then a weight function p for D is virtually 
bar uniform if p(t) = \H p for any tile t in the bar of D whose skinny path distance 
to the balls of D is at least 3n. (The skinny path distance between two subsets of 
D is one less than the minimum length of a chain of tiles which joins a point in one 
subset to a point in the other subset. The skinny path distance is used extensively 
in [3].) We are now ready to state our main theorem. 

Dumbbell Theorem. Every fat flow optimal weight function for a dumbbell is 
virtually bar uniform. 

As indicated above, the theorem has consequences for Riemann mappings. Let 
D be a dumbbell with bar height n, and let p be a fat flow optimal weight function 
for D. If t is a tile in the bar of D whose skinny path distance to the balls of D is at 
least 3n, then the dumbbell theorem implies that p{t) = ~H p . Now we subdivide D 
using the binary square finite subdivision rule 1Z, which subdivides each square into 
four subsquares. We see that 1Z(D) is a dumbbell with bar height 2n. Let s be a tile 
of 1Z(D) contained in t. Then the skinny path distance from s to the balls of 1Z(D) 
is at least 6n. If a is a fat flow optimal weight function for 1Z(D), then the dumbbell 
theorem implies that a(s) — ^H a . It follows that as we repeatedly subdivide D 
using the binary square finite subdivision rule and normalize the optimal weight 
functions so that they have the same height, if they converge, then they converge 
to the weight function of an affine function in the middle of the bar of D. The only 
way that a Riemann mapping of D can be affine in the middle of the bar of D is 
for D to be just a rectangle. Thus our sequence of optimal weight functions almost 
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never converges to the weight function of a Riemann mapping of D. We formally 
state this result as a corollary to the dumbbell theorem. 

Corollary. When we repeatedly subdivide a dumbbell using the binary square 
subdivision rule, if the resulting sequence of fat flow optimal weight functions con- 
verges to the weight function of a Riemann mapping, then the dumbbell is just a 
rectangle. 

Here is a brief outline of the proof of the dumbbell theorem. Let I? be a dumbbell, 
and let p be a fat flow optimal weight function for D. From p we construct a new 
weight function a for D. The weight function a is constructed so that it is virtually 
bar uniform and if t is a tile of D not in the bar of D, then a(t) — p(t). Much effort 
shows that H a = H p . Because a is a weight function for D with H a = H p and p 
is optimal, A p < A a . Because a agrees with p outside the bar of D, it follows that 
there exists a column of squares in the bar of D whose p-area is at most its c-area. 
The main difficulty in the proof lies in controlling the er-areas of such columns of 
squares. The key result in this regard is Theorem 15. 71 Most of this proof is devoted 
to proving Theorem 15.71 It gives us enough information about the cr-areas of such 
columns of squares to conclude that H a = H p and eventually that a = p. Thus p 
is virtually bar uniform. 

We give some simple examples in Section [2] Sections [3] and 0] are relatively easy. 
The skinny cut function <P is defined in the first paragraph of Section O Once 
the reader understands the definition of •P, the statement of Theorem 15.71 can be 
understood. After understanding the statement of Theorem 15.71 the reader can 
read Section [6] to get a better grasp of the proof of the dumbbell theorem outlined 
in the previous paragraph. Theorem l5.7l is the key ingredient, and its proof presents 
the greatest difficulties in our argument. 

In Section [7] we discuss without proofs an assortment of results which are related 
to (but not used in) our proof of the dumbbell theorem. 

2. Examples 

We give some simple examples here to illustrate the theorem. 

Example 2.1. We begin with an example that motivated this work. Consider 
the topological disk D\ shown in Figure [H The left ball of D\ is a union of two 
tiles, the bar of D\ is a single tile, and the right ball of D\ is empty. D\ is not a 
dumbbell because the bar isn't wide enough, but since it has so few tiles it is easier 
of analyze. Figure [3] shows the first three subdivisions of D\ with respect to the 
binary square subdivision rule; the tiling lZ n+1 (Di) is obtained from TZ n (Di) by 
subdividing each square into four subsquares. We consider each subdivision as a 
conformal quadrilateral by choosing the same four points as vertices and the same 
labeling of the edges. 

We considered this example because we were interested in the squared rectan- 
gles corresponding to the optimal weight functions for the fat flow moduli of this 
sequence of quadrilaterals. Given a tiling T (or, more generally, a shingling) of 
a conformal quadrilateral Q, a weight function on T is a non-negative real- valued 
function p on the set of tiles of T. If p is a weight function on T and t is a tile of T, 
then p(t) is the weight of t. One can use a weight function to assign "lengths" to 
paths in Q and "areas" to subsets of Q. The p-length of a path a in Q is the sum 
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FIGURE 2. The quadrilateral Di 



FIGURE 3. The first three subdivisions of Di 



of the weights of the tiles that the image of a intersects, and the p-area A p (W) of 
a subset W of Q is the sum of the squares of the weights of the tiles that intersect 
W . One then defines the p-height H p of Q to be the minimum p-length of a path 
in Q that joins the top and bottom of Q, and one defines the p-area A p of Q to 
be A p {Q). The fat flow modulus of Q with respect to p is H^/A p . The optimal 
weight function is a weight function whose fat flow modulus is the supremum of 
the fat flow moduli of weight functions; it exists and is unique up to scaling. There 
is a squared rectangle corresponding to Q whose squares correspond to the tiles of 
T with non-zero weights under the optimal weight function; furthermore, the side 
length of one of these squares is the weight of the corresponding tile. 

Figure [4] gives optimal weight functions for the first two subdivisions of the 
quadrilateral D\ , and the left side of Figure [5] gives the squared rectangles for 
the first three subdivisions of D\. (See [1] for detailed information about optimal 
weight functions for tilings.) For each of these tilings of D\, one can define a circle 
packing whose carrier complex is obtained from the tiling by adding a barycenter 
to each tile and then subdividing each tile into triangles by adding an edge from 
each vertex of the tile to the barycenter. By only drawing the edges of the packed 
carrier complex that correspond to edges of the tiling, one can use the circle packing 
to draw the tilings. On the right side of Figure this is done for the first three 
subdivisions of D\ using Stephenson's program CirclePack [7]. 
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Figure 4. Optimal weight functions for the first two subdivisions 
of D 1 




While one cannot expect discrete functions from squared rectangles for tilings to 
converge to Riemann maps, He and Schramm showed in [5] that under general hy- 
potheses discrete functions associated to circle packings converge to Riemann maps. 
It may seem that the squared rectangles and the circle packings are not closely re- 
lated, but in some computational examples with the pentagonal and dodecahedral 
subdivision rules we have found that using our squared rectangle software [I] in 
conjuction with CirclePack [7] can lead to dramatic reductions in the total compu- 
tation time for producing the packings. (In an example with over 1,600,000 vertices, 
using both programs led to a reduction in the computation time from almost 38 
hours to under six hours.) We wondered whether the squared rectangles for the 
subdivisions of D\ would define discrete functions which were closely related to the 
Riemann map for D\. 
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But in this example the optimal weight functions appeared to be constant near 
the two sides of the quadrilateral. As we described above, if they really are constant 
then they couldn't possibly give discrete approximations to the Riemann map. We 
found this "apparent" constancy of weights near the sides to be very surprising, and 
decided to look more closely at optimal weight functions for quadrilaterals made 
out of square tiles. 

Example 2.2. Let D be the dumbbell drawn in Figure [TJ The left edge of D is 
the left side of the left ball, and the right edge of D is the right side of the ride ball. 
The bar of D has height 1 and width 8. Figure [6] gives squared rectangles for the 
first three subdivisions of D. For these subdivisions, the weights pi are constant on 
the entire bar and not just on the middle fourth of the bar. 




Figure 6. Squared rectangles for the first three subdivisions of D 



Example 2.3. The dumbbell D 2 shown in Figure [7] is similar to the dumbbell 
D. The two dumbbells have the same bars, but for D 2 the left and right balls are 
smaller. The left edge of D 2 is the top side of the left ball and the right edge of D 2 
is the top side of the right ball. Squared rectangles for the first three subdivisions 
of D 2 are shown in Figure [5J The dumbbell theorem guarantees that the weights 
are constant in the middle fourth of the bar, but here they are constant on most of 
the bar. The quadrilateral D\ is a subcomplex of D 2 . 

da be 



Figure 7. The dumbbell D 2 

3. Weight vectors 

This section deals with basic properties of vectors in Euclidean space. 
We fix a positive integer n, which will be the bar height of a dumbbell under 
consideration. A weight vector is an element of R™ whose components are all 
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FIGURE 8. Squared rectangles for the first three subdivisions of Di 

nonnegative and not all 0. We denote by W the set of all weight vectors in R". 
The height H(x) and area A(x) of a weight vector x = (x\, . . . , x n ) are defined by 



H(x) 



i=i 



and A(x) 



n 
i=l 



Given a positive real number h, we let Wh — • ■ • , and 

W h = {xeW: H(x) = h}. 
We see that w h e Wh and that A(w h ) = 

Lemma 3.1. Let h be a positive real number, let x\,X2 € Wh, and let d\ and di 
be the distances from x\ and X2 to Wh- Then A(x\) < A{x2) if and only if d\ < c?2- 

Proof. Let x G Wh- Since H{x) is the dot product of x with the vector whose 
components are all 1, we have that x ■ (^Wh) = h. Hence x-Wh = ^- = Wh-Wh- So 
the square of the distance from x to w/j is 

(x - Wh) ■ (x - Wh) — x ■ x - 2x ■ Wh + Wh ■ Wh = X ■ X - Wh ■ Wh- 

Thus decreasing A(x) is equivalent to decreasing the distance from 1 to This 
proves the lemma. 

□ 

Corollary 3.2. The weight vector Wh is the unique element ofWh with least area. 

Lemma 3.3. Let x and y be distinct weight vectors such that A(x) > A(y). Then 
the area function restricted to the line segment from x to y is strictly decreasing at 
x. 

Proof. The line segment from x to y is traversed by (y — x)t + x as the parameter 
t varies from to 1. We have that 

A((y-x)t + x) = ((y-x)t+x)-((y-x)t + x) = (y-x)-(y-x)t 2 + 2x-(y-x)t + x-x. 
Viewing this as a function of t, its derivative at t = is 2x ■ (y — x). But since 

x ■ x - 2x ■ y + y ■ y = (x - y) ■ (x - y) > 0, 
we have that 2x ■ y < x ■ x + y ■ y. So 

2x ■ (y — x) = 2x ■ y — 2x ■ x < y ■ y — x ■ x = A(y) — A(x) < 0. 
This proves the lemma. 

□ 
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Lemma 3.4. Every nonempty compact convex set of weight vectors in R" contains 
a unique weight vector with minimal area. 

Proof. Let C be a nonempty compact convex set of weight vectors in R". Since the 
area function is continuous, it has a minimum on C . Let x and y be weight vectors 
in C with minimal area. Lemma 13.31 implies that if x ^ y, then the area function 
restricted to the line segment from x to y is decreasing at x. This is impossible 
because x and y have minimal area, and so x — y. 
This proves Lemma 13.41 

□ 

4. Weight functions which are sums of strictly monotonic cuts 

This section deals with weight functions on rectangles that are viewed as con- 
formal quadrilaterals. Eventually such a rectangle will be chosen to be contained 
in the bar of a dumbbell with the top of the rectangle in the top of the bar and the 
bottom of the rectangle in the bottom of the bar. 

Let R be a rectangle in the plane tiled in the straightforward way by squares 
with n rows and m columns of squares. Let be the square in row i and column j 
for i G {1, . . . , n} and j G {1, . . . , to}. With an eye toward combinatorial conformal 
moduli, we view R as a quadrilateral in the straightforward way. A skinny cut for R 
is strictly monotonic if it contains exactly one tile in every column of R. A weight 
function p on R is a sum of strictly monotonic skinny cuts if p is a nonnegative 
linear combination of characteristic functions of strictly monotonic skinny cuts. 

Lemma 4.1. In the situation of the previous paragraph, the weight function p is 
a sum of strictly monotonic skinny cuts if and only if for every j £ {1, . . . ,m — 1} 
we have that 

n n 
i=l i=l 

and 

k fe+1 fe fe+1 

^2p(Tij) < ^2p(T ij+1 ) and ^p{T lJ+ i) < ^p(Ty) for every k G {1,.. . ,n-l}. 

i=l z=l i=l i=l 

Proof. Suppose that p is a sum of strictly monotonic skinny cuts. Let j G {1, . . . , to— 
1}, and let k G {1, . . . , n— 1}. Then every skinny cut defining p which contains one 
of T\j, . . . ,Tkj also contains one of 7i J+ i, . . . ,Tfe + i J+ i. Since p is a nonnegative 
linear combination of such skinny cuts, it follows that 

fe fe+i 

i=l i=l 

Likewise 

fe fc+i 

i=l i=l 

In the case k = n, every skinny cut defining p which contains one of Ty, . . . ,T n j 
also contains one of Tij+i, ■ ■ • , T n j + i. So 

n n 
i=l i=l 
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The opposite inequality holds by symmetry and so the inequality is actually equal- 
ity. This proves the forward implication of the lemma. 

For the backward implication, suppose that these equalities and inequalities are 
satisfied. We argue by induction on the number ./V of tiles such that piTij) ^ 0. 
The statement to be proved is vacuously true if N = 0. So suppose that N > 
and that the statement is true for smaller values of N. Let j £ {1, . . . , to}. Since 
the zero function is not an allowable weight function and since column sums of 
p-weights are equal, it follows that the sum of the p-weights of the tiles in column 
j is not 0. Let Sj be the tile with the smallest value of i such that p(Tij) ^ 0. 
The inequalities imply that {Si,... , S m } is a strictly monotonic skinny cut. Let 
c = min{p(S'i), . . . , p(S m )}. Let p' be the function on the tiles of R gotten from 
p by subtracting c times the characteristic function of {Si, ... , S m }. If p' is the 
zero function, then p is just c times the characteristic function of {Si, ■ ■ ■ ,S m }. 
Otherwise p' is a weight function which satisfies the inequalities and it has value 
at more tiles than does p. So by induction p' is a sum of strictly monotonic skinny 
cuts. It follows that p is too. 

This completes the proof of Lemma 14.11 

□ 

Lemma 4.2. Let R be a rectangle as above tiled by n rows and m columns of 
squares Tij. Let x = (x\, . . . ,x n ) be a weight vector in R™. Then there exists a 
unique weight junction p for R with minimal area subject to the conditions that p 
is a sum of strictly monotonic skinny cuts and p(Tn) = X\ for every i G {1, . . . , n}. 

Proof. If p is a weight function for R which is a sum of strictly monotonic skinny 
cuts such that p(Tn) = Xi for i S {1, . . . , n}, then the p-height of R is H(x). Thus 
to minimize area, we may restrict to weight functions whose weights are all at most 
H(x). Lemma 14 . 1 1 shows that we are minimizing area over a compact convex subset 
of R nm . This subset of R nm is nonempty because it contains the weight vector 
corresponding to the weight function p for R for which p{Tij) — xt for every i and 
j. Now Lemma [331 completes the proof of Lemma l4~2l 

□ 

5. The skinny cut function 

We maintain the setting of Section [H and continue to let W denote the set of 
all weight vectors in R". Lemma l4~2l allows us to define a function $ : W — > W as 
follows. Let x — (jci, . . . , x n ) £ W. We apply Lemma in the case where m = 2, 
so that the rectangle R has n rows and only two columns. Then there exists a 
unique weight function p for R with minimal area subject to the conditions that p 
is a sum of strictly monotonic skinny cuts and p(Tn) = xi for i £ {1, . . . , n}. Let 
y = (yi, . . . , y n ) be the weight vector such that yi = p{Ti2) for i £ {1, . . . , n}. We 
set Q(x) — y. This defines <!>, which we call the skinny cut function. This section 
is devoted to the investigation of 4>. 

Let x £ W, and let h = H(x). Lemma |4~T1 implies that H($(x)) = h, so $ maps 
Wh into WV Moreover it is not difficult to see that if r is a positive real number, 
then $(rx) = r$(x). 

Let ftbea positive real number, and let x £ Wh- From x we obtain real numbers 
Po = < pi < p 2 < ■ ■ ■ < Pn = h by setting p k = £\ =1 Xi for k £ {0, ... , n). We 
denote (po, . . . ,p n ) by n(x) and we call it a weak partition of the closed interval 
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[0, h] as opposed to a strict partition in which the numbers po, . . . ,p n are required 
to be distinct. We call pk the fcth partition point of x. This correspondence gives 
a bijection between Wh and the set of all weak partitions of [0, h]. 

The inequalities of Lemma 14.11 can be easily interpreted in terms of partition 
points. Let R be a rectangle as before with n rows and two columns of tiles. Let 
p be a weight function on R which is a sum of strictly monotonic skinny cuts. Let 
x = (xt, ...,x n ) and let y = (y x , . . . , y n ), where x t = p(T 41 ) and y l = p(T i2 ) for 
each i. Let (po, . . . ,p n ) — n(x), and let (go, ■ ■ ■ , Qn) = ""(?/)■ Lemma HTTI implies that 
H(x) = H(y). Then, assuming that H(x) — H(y), the inequalities of Lemma |4~T1 
are equivalent to the inequalities qk-i < Pk < Qk+i for every k 6 {1, . . . ,n — 1}. 
Likewise, assuming that H(x) = H(y), the inequalities of Lemma 14 . 1 1 are equivalent 
to the inequalities Pk-i < Ik < Pk+i for every i 6 {1, . . . , n — 1}. We say that two 
weight vectors x and y are compatible if H(x) = H(y) and qu—i < Pk < 9fc+i> 
equivalently p k -i < qk < Pk+i, for every fee {1, . . . , n - 1}, where (p , ...,p n ) = 
n(x) and (qo, • ■ ■ ,q n ) — ft(y)- This gives us the following reformulation of the 
definition of $, formally stated as a lemma. 

Lemma 5.1. If x is a weight vector, then is the weight vector with minimal 
area which is compatible with x. 

Let x — (xi, . . . , x n ) £ W, and let {po, . . . ,p n ) — n(x). Let k € {1, . . . , n — 1}. 
We call pk a left leaner for x it Xk > Xk+i, and we call pk a rig/i£ leaner for a; if 
Xfe < Xfc+i. The rest of this paragraph explains this terminology. Suppose that pk 
is a left leaner for x for some k € {1, . . . ,n — 1}. Then a;*; > Xk+i- The function 
/(t) = (t — Pk-i) 2 + {Pk+i — t) 2 is convex on the closed interval \pk-i, Pk+i] with a 
minimum at the interval's midpoint. This and the inequality x% > Xk+i imply that 
decreasing pk while fixing all other partition points of x decreases the area of x. So 
we view pk as leaning left toward a position of less area for x. A similar discussion 
holds for right leaners. 

Lemma 15. II and the previous paragraph imply for every x € W that A(<f>(x)) < 
A{x) unless x has no leaners. But if x has no leaners, then x — Wh, where h — H(x). 
Corollary 13.21 implies that $>(wh) — Wh- Thus for every positive real number h the 
restriction of $ to Wh has a unique fixed point, namely, Wh- We formally state 
some of the results just proved in the following lemma. 

Lemma 5.2. For every positive real number h the restriction of $ to Wh is a 
function $ | w : Wh — ► Wh which reduces areas of weight vectors except for at the 
unique fixed point, Wh ■ 

Now let x and y be compatible weight vectors with (po, ■ ■ ■ ,p n ) = 7r(x) and 
(qo, . . . , q n ) — 7r(y)- Suppose that qk is a left leaner for y for some k 6 {1, . . . , n— 1}. 
We say that x blocks qk if qk = Pk-i- The motivation behind this terminology is 
that for x and y to be compatible the inequality qk > Pk-i must be satisfied, and 
so even though qk is leaning left, x prevents us from decreasing qk to decrease the 
area of y. Similarly, if qk is a right leaner for y, then we say that x blocks qk if 
qk = Pk+i- 

Lemma 5.3. Let x and y be compatible weight vectors. Then every left leaner for 
y which is blocked by x is a left leaner for x. Similarly, every right leaner for y 
which is blocked by x is a right leaner for x. 
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Proof. Suppose that x = (xi, . . . ,x n ), y = (yi, ■ ■ ■ ,y n ), n(x) = (po, . • . ,p n ), and 
7r (y) = {<lo> ■ ■ ■ , Qn)- Suppose that qk is a left leaner for y blocked by x for some 
k e {1, . . . , n — 1}. This means that > y^+i and that qk — Pk-i- Since g^-i > 
Pfe-2, we have j/fc < Xfc_i. Since gt+i > pk, we have y^+i > Xfc. Combining these 
inequalities, we obtain that x^-i > 2/& > 2/fe+i > Xfc, and so pk-i is a left leaner 
for x. Hence every left leaner for y which is blocked by a; is a left leaner for x. A 
similar argument holds for right leaners. 
This proves Lemma 15.31 

□ 

Lemma 5.4. Let x £ W. TTien every left leaner for $(x) is a Ze/t leaner for x 
blocked by x, and every right leaner for 3>(x) is a right leaner for x blocked by x. 
Conversely, if y is a weight vector compatible with x such that every leaner for y is 
blocked by x, then y = $(x). 

Proof. Let q be a left leaner for $(x). Then q is blocked by x, for otherwise it is 
possible to decrease A($(x)) by decreasing q. Thus every left leaner for $(x) is 
blocked by x. Now Lemma [5731 implies that every left leaner for $(x) is a left leaner 
for x. Similarly, every right leaner for $(x) is a right leaner for x blocked by x. 

Now suppose that y is a weight vector compatible with x such that every leaner 
for y is blocked by x. Suppose that y 7^ $(x). All weight vectors on the line 
segment from y to $(x) are compatible with x. Lemma T3.3I implies that the area 
function restricted to this line segment is strictly decreasing at y. But as we move 
the partition points of y linearly toward the partition points of $(x) we either move 
leaners away from blocked positions, which increases area, or we move nonleaners, 
which also increases area. Thus y — <5(x). 

This proves Lemma 15.41 

□ 

We next introduce the notion of segments. Let x £ Wh for some positive real 
number h, and let ir(x) = (po, . . . ,p n )- Let i and j be indices such that pi is either 
or a leaner of x, pj is either h or a leaner of x, i < j and pk is not a leaner 
of x if i < k < j. Then Xj+i = • • • = Xj, x, 7^ Xi+i if i > 0, and Xj+i 7^ Xj if 
j < n. We call (x^+i, . . . , Xj) a segment of x. In other words, the leaners of x 
parse x into segments such that the coordinates of x in every segment are equal, 
and segments are maximal with respect to this property. We call pi and pj the 
endpoints of the segment (xj+i, ...,Xj). By the value of a segment we mean the 
value of any component of x in that segment. By the dimension of a segment we 
mean the number of components in it. By the height of a segment we mean the sum 
of its components. Each endpoint of a segment is either a leaner or an endpoint 
of the interval [0, h). If the left endpoint of a segment is a left leaner, then we say 
that it leans away from the segment, and if it is a right leaner, then we say that it 
leans toward the segment. The situation is similar for right endpoints. 

Let y be a weight vector in the image of <E>. A weight vector x is called a minimal 
preimage of y if it satisfies the following: 

i) $(x) = y. 

ii) If x' is a weight vector with $(x') = y, then A(x) < A(x'), with equality if 
and only if x = x' . 
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Lemma 5.5. Let y be a weight vector, and let ti,...,t m be the segments of y in 
order. Then y is in the image of $ if and only if there is a weight vector x with 
segments s±, . . . , s m in order which satisfies the following: 

(1) The height of Si equals the height of ti for i G {1, . . . , m}. 

(2) For every i G {l,...,m} the dimension of Si is the dimension of ti plus 
en — Ti where cti is the number of endpoints of ti which lean away from ti 
and Ti is the number of endpoints of ti which lean toward U . 

Furthermore, if y is in the image of $ then this vector x is unique and is the 
minimal preimage ofy. 

Proof. We first suppose that y is in the image of and let w be a weight vector 
such that = y. Suppose that the right endpoint of t\ leans toward t\, namely, 

that it is a left leaner. Suppose that the right endpoint of t\ is partition point k ofy. 
Lemma |5 .41 implies that this left leaner is blocked by w, and so it is partition point 
number fc— 1 of w. This proves that if the right endpoint of t\ leans toward t\, then 
the dimension of t\ is at least 2. Similarly, if the left endpoint of the last segment 
t m leans toward t m , then the dimension of t m is at least 2. More generally, if both 
endpoints of some segment of y lean toward that segment, then the dimension of 
that segment is at least 3. With the results of this paragraph and the fact that 
53i=i( Q! i — T i) — 0; we see that conditions 1 and 2 in the statement of Lemma 15.51 
uniquely determine a weight vector x. 

For the converse, suppose that a: is a weight vector with segments s\, .. . ,s m in 
order which satisfies conditions 1 and 2 of the lemma. An induction argument on 
i shows that the endpoints of ti other than and h are blocked by x and that x is 
compatible with y. By Lemma l5.4| = y and so y is in the image of $. 

For the last statement, suppose that y is in the image of <!>. Let x be the weight 
vector which satisfies conditions 1 and 2 of the lemma, and let x' be a weight vector 
with = y. Let i 6 {1, . . . , m}. Because the endpoints of ti other than and h 

are blocked by x' , the number of components of x' between these endpoints equals 
the number of components of x between these endpoints, namely, the dimension 
of Si. But since the components of Si are all equal, Corollary 13.21 implies that the 
area of Si is at most the sum of the squares of the corresponding components of x' . 
Thus A(x) < A(x'), and equality holds if and only if x = x' . 

This proves Lemma 15.51 □ 

This section has thus far been concerned with the definition of <£> and basic 
relationships between a weight vector and its image under We now turn to area 
estimates. The next result, Theorem 15.61 prepares for Theorem 15.71 which is the 
key ingredient in our proof of the dumbbell theorem. 

Theorem 5.6. Let h be a positive real number, and let x £ W^. Then there exists 
a weight vector w € Wh satisfying the following conditions. 

(1) Both w and &(w) have at most two segments with nonzero values. Further- 
more, if they have two segments with nonzero values, then these segments 
are consecutive. 

(2) The weight vector w is the minimal preimage of <L>(w). 

(3) A(w) < A(x). 

(4) A(<P(w)) > A{$(x)). 
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Proof. Set 

W = {z G W h : A{z) < A(x) and A(<5>(z)) > A($(x))}. 

Since every element of W satisfies conditions 3 and 4, it suffices to prove that W 
contains a weight vector satisfying conditions f and 2. We will do this by first 
proving that W contains an element with minimal area, ft follows that w is the 
minimal preimage of <f>(w), giving condition 2. We then prove that w satisfies 
condition 1. 

In this paragraph we prove that W contains an element with minimal area. Since 
x £ W, W is not empty. The set Wh is compact, and so the closure W of W in Wh 
is compact. Since the area function is continuous, there exists a weight vector w in 
W with minimal area. Because the area function is continuous and w S W, we have 
that A(w) < A(x). It remains to prove that A($(to)) > A(&(x)). For this let w' be 
a weight vector in W near w. Lemma 15-11 shows that $(iu) is compatible with w. 
Because w' is near w, there exists a weight vector y near $(u>) which is compatible 
with w' . Because y is near the area of y is not much larger than the area 

of <&(u>). Now Lemma 15.11 shows that the area of $(«/) is not much larger than 
the area of Since A($(u/)) > A($(x)), it follows that A($f») > A(®(x)). 

Thus w € W. This proves that W contains an element with minimal area. 

We fix an element w in W with minimal area. Lemma 15.51 implies that w is the 
minimal preimage of <f>(w). To prove Theorem l5.6l it suffices to prove that w cannot 
have nonconsecutive segments with nonzero values. We do this by contradiction. 
So suppose that w has nonconsecutive segments with nonzero values. Let sx,...,s m 
be the segments of w in order with dimensions d%, . . . , d m , values v\, . . . , v m , and 
heights hi, . . • , h m , so that hi = diVi for every % € {1, . . . , m}. 

In this paragraph we show that w has two consecutive segments with nonzero 
values. We proceed by contradiction: suppose that w does not have two consecutive 
segments with nonzero values. Then every other value of w is 0. Let i € {1, . . . , to} 
such that Vi ^ 0, Uj+i = 0, and Vi+2 ^ 0. Because of the symmetry with respect to 
the order of the components of w, we may, and do, assume that vi < Vi+2- Let Aq 
be the sum of the areas of the segments of w other than Sj, s^+i, and Then 

h 2 h 2 

Similarly, if Bq is the sum of the areas of the segments of §(w) other than those 
corresponding to Sj, Si+i, and Si+2, then 



A($(«0) = B 

where 



and 

e,:+2 



dj + 1, if i = 1 
di + 2, if t > 1 

di_(_2 + lj if i = to 
dj + 2 + 2, if i < to. 



We construct a new weight vector w' by modifying w as follows. Where if has 
segments s;+i, s^ + 2 with heights ft.^, 0, /ii+2 and dimensions di, di+i, c?i+2, 
the weight vector w' has segments with heights 0, ftj, h i+2 and dimensions d-, 
+ 1, di + 2- The heights and dimensions of all other segments of w' equal the 
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corresponding heights and dimensions of w except that if i > 1, then w' has one 
fewer segment than w because segment Si-x of w has value and so the components 
of to in Sj+i must be adjoined to Sj_i. So 

A{w') = A Q + -M- + f j ±1 < A(w) < A(x). 
Using Lemma 15.51 we see that 

A(Mw')) =B Q + -4t + Mt_ > A(Mw)) > A(*(x)). 
di + 1 e l+2 

This contradicts the fact that w is an element in W with minimal area. Thus w has 
two consecutive segments with nonzero values. If w only has two segments with 
nonzero values, then we are done. Hence we may assume that w has at least three 
segments with nonzero values. 

Next suppose that w' is a weight vector with the same number of segments as 
w, the segments of w' have the same dimensions as the corresponding segments of 
w but the leaners of w' are gotten by perturbing the leaners of w, equivalently, the 
heights of the segments of w' are gotten by perturbing the heights of the segments 
of w. If this perturbation is small enough, then Lemma [5.41 implies that $(it/) is 
gotten from by exactly the same perturbation. We will prove that there exists 
such a perturbation such that A{w') < A{w) and A($(w')} = A($(w)). Thus w' 
is an element of W with smaller area than w, a contradiction which will complete 
the proof of Theorem 15.61 

To begin the construction of such a perturbation, we note that the area of w is 

i—l i—1 ^ 1 ' i—1 1 i—1 

where cti = d^ 1 for every i 6 {1, . . . , m}. Recall that w is the minimal preimage of 
$(w). As above, if ti, . . . ,t m are the segments of 4>(w) in order with dimensions 
A^ 1 , . . . , A" 1 , then the area of is 

m 

»=i 

This leads us to define functions / : R m — » R and _F : R m — > R so that 

m 

/(xi, . . . ,x m ) = £aj(a;j + 

i=l 

and 

m 
i=l 

To prove Theorem 15. 6[ it suffices to find points x — (x±, . . . ,x m ) E R m arbitrarily 
near the origin such that J^iLi x i = 0, f(x) < /(0), F(x) — F(Q), and X{ > if 
hi = 0. 

We do this while fixing all but three components of {x\, . . . , x m ). We spec- 
ify these components later in this paragraph. Since only three components of 
(xi,...,x m ) are allowed to vary, the functions / and F are really functions of 



SQUARING RECTANGLES FOR DUMBBELLS 



15 



three variables. To simplify notation, we now view / and F as functions from R 3 
to R with 

f(x, y, z) = a(x + x ) 2 + b(y + y ) 2 + c(z + z Q ) 2 + d 

and 

F(x, y, z) = A(x + x f + B(y + y ) 2 + C(z + z ) 2 + D. 

Since w has two consecutive segments with nonzero values, by Lemma 15.51 <f>(w) 
has two consecutive segments with nonzero values. Let the variable x correspond 
to a segment of $(u>) with nonzero value which is adjacent to a segment with 
larger value. Let z correspond to a segment of $(u>) with maximal value. Let y 
correspond to any segment of $(w) with nonzero value not already chosen. Note 
that Xq, 2/0 1 z o > 0. Because the segment of $(u>) corresponding to x has an endpoint 
leaning away from it, its dimension A -1 is at most as large as the dimension a -1 
of the corresponding segment of w. Thus < a^ 1 , and so A > a. Because 
the segment of $(«;) corresponding to z has no endpoint leaning away from it, 
C _1 > c _1 , and so C < c. Because the value of the segment of <£>(u>) corresponding 
to z is larger than the value of the segment corresponding to x, we have Czq > Axq. 
To prove Theorem [52] it suffices to find points (x, y, z) E R 3 arbitrarily near (0, 0, 0) 
such that x + y + z = 0, f(x, y, z) < /(0, 0, 0), and F(x, y, z) = F(0, 0, 0). 

The set of all solutions (x, y, z) to the equation f(x, y, z) = /(0, 0, 0) is an ellip- 
soid containing the point (0, 0, 0). The gradient of / at (0, 0, 0) is (2axo, 2byo, 2czq). 
Because axo, byo, czq are not all equal, the plane given by x + y + z = con- 
tains the point (0,0,0), but it is not tangent to the ellipsoid. Thus this plane 
intersects the ellipsoid in an ellipse. It likewise intersects the ellipsoid given by 
F(x,y,z) = ^(0,0,0) in an ellipse. These ellipses both lie in this plane and they 
have a point in common. If they intersect transversely at (0, 0, 0), then it is easy to 
find points arbitrarily near (0, 0, 0) which lie within the first ellipse and lie on the 
second one. In other words, Theorem l5.6l is true if the ellipses intersect transversely. 

Finally, we assume that the ellipses are tangent at (0, 0, 0). Let L be the line in 
R 3 tangent to the ellipses at (0, 0, 0). Then L lies in the plane given by x + y + z = 
and it lies in the tangent planes to both of the ellipsoids. These three planes have 
normal vectors given by (1,1,1), (clxq, by$, czq) and (Axq, By$, Czq). Thus these 
three vectors are linearly dependent. Thus the same is true for the columns of the 
matrix 

1 axo Axq 
1 by Q By . 

1 CZq Czq 

The cross product of any two rows is orthogonal to all three rows. The cross product 
of row 1 and row 3 is 

v = ({aC — Ac)xqZq, Axq — Czq, czq — axo). 

Since a < A and C < c, we have (aC — Ac)xqzq < 0. Since Czq > Axq, we have 
Axq — Czq < 0. Since v is orthogonal to the rows of the preceding matrix, it is also 
orthogonal to the rows of 



-J_ 


a 


A 


xo 








h 


B 


Ho 








c 


C 


- 20 
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So 

(aC-Ac)x z (—,—,— ) +(Axo-Czo)(a,b,c) + (cz -ax )(A,B,C) = (0,0,0). 

Solving for (a, 6, c), we find that 

(a,b,c)=r(— ,-,-) +s(A,B,C) 

for some real numbers r and s with r < 0. Viewing f(x, y, z) and F(x, y, z) as dot 
products of vectors except for the final additive constants, we have that 

f(x, y,z)- 7(0,0,0) 

= (a, b, c) ■ ((x + x ) 2 , (y + y ) 2 , (z + z ) 2 ) - (a, b, c) ■ (xq, y%, zl) 
= (a, 6, c) ■ (x 2 + 2x x, y 2 + 2y y, z 2 + 2z a z) 

— ,—,— ) • (x 2 + 2x x,y 2 + 2y y,z 2 + 2z z) 
x Vo z J 

+ s(A, B, C) ■ (x 2 + 2x x, y 2 + 2y y, z 2 + 2z z) 

+ s[F(x,y,z)-F(0,0,0)} 



—x 2 + —y 2 + —z 2 ) +2{x + y + z) 
xo y z / 

\x y z J 

So if (x,y,z) is any point on the ellipse corresponding to F, then F(x, y, z) — 
F(0, 0, 0) = and so f(x, y, z) - f(0, 0, 0) < 0, as desired. 
This proves Theorem 15.61 

□ 

Theorem 5.7. Let h be a positive real number. Let X\, . . . ,Xs n be weight vectors 
in W/i such that A(a;,+i) < A(<P(xi)) for every i G {1, .. .,3n— 1}. Then x^ n = Wh- 



Proof. Lemma 15.21 implies that <L> does not increase area, and so 

A{xi) >■■■> A(x 3n ). 

Lemma I5~2l and Corollary 13 . 21 furthermore imply that if two of x±, . . . , x 3n have the 
same area, then x^ n = Wh- So we may, and do, assume that x\,..., x^ n are distinct. 

Since <!>{hx) = h$(x) for every weight vector x € VVi, we may, and do, assume 
that h = 1. 

Let i € {l,...,3n — 1}. Theorem 15.61 implies that there exists a weight vector 
x\ 6 Wi such that both x\ and l P(x' i ) have at most two segments with nonzero 
values, the weight vector x\ is the minimal preimage of <P(x' i ), A^x^) < A(xi) and 
A^ix'i)) > A($(xi)). Set x' 3n = x 3n . Then 

A(x' i+1 ) < A(x l+1 ) < A(${ Xi )) < A^ix'A) 

for every i € {1, . . . , in— 1}. Thus to prove Theorem l5.7[ we may replace x\, . . . , x?, n 
by x' 1; . . . , x' 3n . In other words we may, and do, assume for every i S {1, . . . , 3n— 1} 
that Xi and <P(xi) do not have nonconsecutive segments with nonzero values and 
that Xi is the minimal preimage of <P(xi). 

At this point the argument splits into two cases which depend on how many 
of the weight vectors x\, . . . , x^n have segments with value 0. In Case 1 we prove 
Theorem 15 . 71 under the assumption that at least n of the weight vectors Xi, . . . , x 3n 
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have a segment with value 0. In Case 2 we prove Theorem 15 . 71 under the assumption 
that fewer than n of the weight vectors x±, . . . , X3 n have a segment with value 0. 

We begin Case 1 now. Assume that at least n of the weight vectors x±, . . . , x$ n 
have a segment with value 0. Let z\, ■ ■ . , z n be distinct elements of {xi, . . . , x^n} 
in order each of which has a segment with value 0. Then A(zi+i) < A{<P{zi)) for 
every i G {1, . . . , n — 1}. 

In this paragraph we show that in Case 1 it suffices to prove for every i S 
{1, . . . , n — 1} that 

(5.8) A(z i+1 )(A(zi) + 1) - A( Zi ) < 0. 
Indeed, this inequality is equivalent to the inequality 

A{z * +l) - A(z0-i + r 

Since H(zi) = 1, we have < 1. A straightforward induction argument 

based on the preceding displayed inequality shows that A(zi) < 1/i for every 
i € {l,...,n}. In particular A(z n ) < 1/n. But Corollary 13.21 implies that w\ 
is the unique element in Wi with area at most Hence z n = x^ n = w±. Thus to 
prove Theorem 15. 71 in Case 1 it suffices to prove line 15. 81 for every t 6 {l,...,n-l}. 

In this paragraph we assume that there exists i G {1, . . . ,n — 1} such that Zi 
has only one segment with nonzero value. Let d be the dimension of this segment. 
Then A{zi) = l/d. Moreover <P(zi) also has just one segment with nonzero value 
with dimension either d+ 1 or d + 2. Hence A(<l>(zi)) is either l/(d+ 1) or l/(rf + 2). 
So line 15.81 holds. Thus to prove Theorem 15. 71 in Case 1 it suffices to prove line 15.81 
under the assumption that Zi has two segments with nonzero value. 

So let i £ {1, ... ,n — 1}, and suppose that z% has two consecutive segments s 
and t in order with nonzero value. Let p be the right endpoint of s, equivalcntly, 
the left endpoint of t. Let d be the dimension of s, and let e be the dimension of t. 
The value of s is p/d, and the value of t is (1 — p)/e. It follows that 

(5.9) A(zi) = d~y + e- 1 (l-p) 2 . 

Let d 1 and e' be the dimensions of the segments s' and t' of ${zi) corresponding to 
s and t. Then 

A(${z i )) = {d')-V + {e')- 1 {l-p) 2 . 
By symmetry we may assume that 

p_ < l-p 
d' ~ e' 

The last inequality implies that p is a right leaner for Hence Lemma [5.41 

implies that p is a right leaner for Zi and so 

p < l-p 

d ~ e 

Combining this with line 15.91 yields that 

A(zi) = d^p 2 + e-^l - pf < e"V(l - p) + e- x (l - p) 2 = e- x (l - p). 
Similarly, we have that 

(5.10) A{<£{ Zi ))<{e')- l {l-p). 

Because Zi has a segment with value and Zi is the minimal preimage of <&(zi), it 
is also true that $(zi) has a segment with value 0. These facts and Lemma [531 imply 
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that if <l?{zi) has a segment with value preceding s', then d! = d and e' > e + 1. 
If 4>{zi) does not have a segment with value preceding s', then d' = d — 1 and 
e' = e + 2. 

Suppose that <P(zi) has a segment with value preceding s'. The previous 
paragraph shows that d' = d and e' > e + 1. Combining this with line 15.101 yields 
that A(${zi)) < (e + l) _1 (l-p). Hence 

< A($(zi))A(zi) + A(#(zj)) - A(z() 



< 



1 >2 , 1 2 , 1 n ^2 1 o 1 



< 



e(e + l) 



< o. 

Thus line 15.81 is true if $>{zi) has a segment with value preceding s' . 

Next assume that <P(zi) does not have a segment with value preceding s'. Then 
the next-to-last paragraph shows that d' = d — 1 and e 1 = e + 2. Combining this 
with line [5TTU1 yields that A($(zi)) < (e + 2)~ 1 (1 -p). We also have that 

P P < ±-P 1 ~P 



d-1 d' - e> e + 2 
Hence 

A(z i+1 )(A(zi) + l)-A(zi) 

<A($(zi)){A(zi) + l)-A(zi) 

< A($(zi))A(zi) + A($( Zi )) - A( Zi ) 



< 



1 (i-p) 2 +ip 2 +^- P ) 2 --y--^-p) 2 



e(e + 2) v ^ d'* e' v ^ tT e 

< - p) 2 + ^-y +^-m-p) 2 - -/ - -a - p) 2 

e(e + 2) a — 1 e + 2 a e 

s - p)2 + v^w/ - w^T) f 1 - *> 2 

£ ^< 1 -*> 2 + ^< 1 -*> 2 -;^h)' 1 - J, > 2 

< o. 

Thus line 15.81 is true if ^>(zj) does not have a segment with value preceding s' . 

The proof of Theorem 15.71 is now complete in Case 1, namely, Theorem 15.71 is 
proved if at least n of the weight vectors x±, . . . , x^ n have a segment with value 0. 

Now we proceed to Case 2. Suppose that fewer than n of the weight vectors 
xi, . . . , X3 n have a segment with value 0. Then at least 2n of the weight vectors 
xi, . . . , X3 n do not have a segment with value 0. Let Z\, . . . , Z2 n be distinct elements 
of {x\, . . . , X3 n } in order none of which has a segment with value 0. Then A(zi+i) < 
A{<P(zi)) for every i E {!,... ,2n — 1}. As we have seen, to finish the proof of 
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Theorem 15. 7\ it suffices to prove that zi n — w\ . We do this by contradiction. 
Suppose that z% n ^wi- As we have seen, it follows that z« ^ w\ for i € {1, . . . , 2n}. 

Let i £ {1, . . . , 2n — 1}. Then z, has at most two segments, Zj has no segment 
with value 0, and Zj is the minimal preimage of <?(zi). The only weight vector in 
Wi with one segment is wi, so Zj has two segments neither of which has value 0. 

Let the first segment of Zj have right endpoint p and dimension d. Then the 
second segment of Zj has left endpoint p and dimension n — d. So 

^ _ V + ' a -,)« = ft-^ + -'('-rt' 

a (n — a) d(n — a) 

np 2 — dp 2 + d — 2dp + dp 2 ra 2 p 2 — 2dnp + 

(np — d) 2 — d 2 + dn (d — np) 2 1 
d(n — d)n d(n — d)n n 

(d — np) 2 



Set 



B(zi) 



d(n — d) 



Then 



A{ Zi ) = -B( Zi ) + -. 
n n 

Since Zj has two segments neither of which has value 0, the same is true of <P(zi). 
We define B(<l>(zi)) in the same way that we define -B(zj). Then the given inequality 
A(z i+ i) < A(<P(zi)) is equivalent to the inequality B(z i+1 ) < B('P(z i )). We focus 
on this latter inequality. 

Let a be the positive real number such that fi(zj) = a 2 . Then 



(d — np) 2 



a 2 



d(n — d) 
(d — np) 2 = a 2 d(n — d) 
d 2 - nd + a~ 2 (d - np) 2 = 

(d-^) 2 +a- 2 (d-np) 2 = ^ 

We are led to consider the family £ of all ellipses of the form x 2 + a~ 2 y 2 = 1, 
where a is a positive real number. The open upper halves of the ellipses in £ fill 
the open half infinite strip S — {(x,y) & R 2 : \x\ < 1, y > 0}. 

The weight vector Zj determines the point 

/ n \ (Id 2d 
P(zi) = {x hyi ) = 1, 2p 



v n n 

in R 2 , and we next show that we may assume that P(zj) € S. It is easy to see that 
\xi\ < 1, so what is needed is the inequality yi > 0. Since Zj and $(zj) have exactly 
one leaner, by symmetry we may, and do, assume that this leaner is a right leaner. 
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Hence 



p(n - d) < d(l 
pn < d. 




P) 
P) 



It follows that yi > 0, and so P{zi) £ S. It is even true that yi < xi + 1, and so 
P(zi) lies in the triangle T = {(x, y) £ R 2 : \x\ < 1, < y < x + 1}. 

The point of R 2 corresponding to <P(zi) likewise lies in T. Moreover since the 
first segment of Zi has right endpoint p and dimension d, it follows that the first 
segment of ^{zi) has right endpoint p and dimension d — 1. In other words, the 
point of T corresponding to ^(zi) is 

\ n n J \ n n J 

Now we can reformulate our problem as follows. Let r : R 2 — > R 2 be the transla- 
tion defined by t(x, y) = (x — ^, y — -). We begin with a point t\ £ T. Let E\ be 
the ellipse in £ containing t-y. We have that r(ti) £ T and r(ti) lies within E\. Let 
be the ellipse in £ containing r(ti). Let ti be a point of T either on or within 
E[. We iterate this construction to obtain points t±, . . . , t-2 n £ T. To complete the 
proof of Theorem 15.71 it suffices to prove that it is impossible to construct points 
ti , ■ ■ ■ , m this way. 

To this end, let A be the subset of R 2 which is the union of the closed line 
segment joining (0,0) with (0,1) and the closed line segment joining (0,1) with 
(1,2). See Figure EE We define a function a: T -> R as follows. Let t £ T. There 
exists a unique ellipse E £ £ which contains i, and A fl E consists of a single point. 
Let a(t) be the y-coordinate of this point. 




(1,2) 



(-1,0) 



(0,0) 



(1,0) 



Figure 9. Defining a(t). 



We will prove that if ti, ■ ■ ■ , tin are points in T as in the next-to-last paragraph, 
then a(U+i) < a{U) — 1/n for every i £ {1, . . . , 2n — 1}. This suffices to complete 
the proof of Theorem 15.71 Because ii+i lies either on or within the ellipse in £ 
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containing r(fcj), it actually suffices to prove that a(r(ti)) < a(ti) — 1/n for every 
i € {1, . . . , In — 1}, which is what we do. 

So let % £ {1, . . . , 2n — 1}. Let Ei and E[ be the ellipses in £ which contain ti 
and r(ti). 

First suppose that the x-coordinates of both ti and t(U) are nonnegative. See 
Figure[lOl In this case it is clear that we actually have that a(r(ij)) < aiti) — 2/n, 
as desired. 




(1,2) 



(-1,0) (0,0) (1,0) 

Figure 10. When the ^-coordinates of both U and t(U) are nonnegative. 

Next suppose that the x-coordinates of both ti and r(ti) are nonpositive. See 
Figure [Til Since x 2 + a~ 2 y 2 = 1, we have that y 2 — a 2 (l — x 2 ). Differentiating this 

(0.1) 




(-1,0) (0,0) (1,0) 

Figure 11. When the x-coordinates of both U and t(U) are nonpositive. 

with respect to x yields that 2yy' = —2a 2 x. Solving for y' and replacing a 2 with 
y 2 /{l — x 2 ) yields that 
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Since y < x + 1 and x/(x 2 — 1) > if —1 < x < 0, we have that y' < x/(x — 1). 
It follows that y' < 1/2 throughout T. This implies that the vertical distance from 
ti to E[ is at least 1/n. Line 15. Ill implies that Ei increases more than E[ as they 
approach A from the left. It follows that a(r(ti)) < a(ti) — 1/n, as desired. 

Finally, if the x-coordinate of U is positive and the x-coordinate of r(ij) is 
negative, then an argument similar to that in the previous paragraph shows again 
that a{r(ti)) < a(ti) — 1/n. 

This completes the proof of Theorem 15. 71 

□ 

6. The proof of the dumbbell theorem 
We prove the dumbbell theorem in this section. 

Let D be a dumbbell. Let p be a fat flow optimal weight function for D. Let 
h = H p . We scale p so that it is a sum of minimal skinny cuts, making h an integer. 
Suppose that D has bar height n. Let C±, . . . , Cj, n be the leftmost 3n columns in 
B in left-to-right order, and let D%, . . . , D^ n be the rightmost 3n columns in B in 
right-to-left order. 

For every i G {1, . . . , 3n}, applying p to the tiles in d obtains a weight vector 
yi G R n and applying p to the tiles in Di obtains a weight vector Zi G R". From 
the assumption that H p — h, it follows that H(y{) > h, and so by scaling yi we 
can obtain a weight vector X\ G Wh such that A{x\) < A(y\). Let j be the largest 
element of {1, . . . , 3n} such that there exist weight vectors x\, . . . ,Xj £ Wh with 
A{x\) < A(yi) and if j > 1, then A(xj) > A(yj) and A(x i+1 ) < A{$(xi)) for every 
i G {1, . . . , j — 1}. Similarly, let k be the largest element of {1, ... , 3n} such that 
there exist weight vectors xi,...,Xk G with A(xi) < A(z\) and if k > 1, then 
A(x k ) > A(z k ) and A{x i+ \) < A($(xi)) for every i G {1, . . . ,k — 1}. 

In this paragraph we define a new weight function a on D. Let t be a tile 
of D. If t is contained in either one of the balls of D or one of the columns 
Ci, . . . , Cj-x,Dk-i, ■ • ■ , D\, then a(t) — p(t). If t is in B and strictly between C^ n 
and D^ n , then a(t) = h/n. Now suppose that t is contained in Cj. We have that 
p is a sum of h minimal skinny cuts. We define a(t) to be the number of these 
skinny cuts c with the property that as c proceeds from the left side of D to Cj, 
the tile t is the first tile of Cj in c. This defines a(t) for every tile t in Cj. These 
values determine a weight vector Xj G in the straightforward way. If j < 3n, 
then we use the weight vector Xj+i = ^(xj) in the straightforward way to define 
a(t) for every tile t in Cj+\. We inductively set Xi+\ = &(xi) and use this weight 
vector in the straightforward way to define a(t) for every tile t in Ci+i for every 
i G {j + 1, . . . , 3n — 1}. This defines a(t) for every tile t in Cj, ... , C^ n . We define 
cr(t) for every tile t in Z?^, . . . , D^ n analogously. The definition of a is now complete. 

By assumption there exist weight vectors x\, . . . ,Xj G Wh with A{x\) < A(yi) 
and if j > 1, then A(xj) > A(yj) and A(x i+ i) < A(<P(xi)) for every i G {1, .. ., j — 
1}. In defining a we constructed weight vectors Xj, a^j+i, ■ ■ ■ , x$ n G Wft with Xj+i — 
&(x~j) and Xi+i = <P(xi) for every i G {j + 1, . . . , 'in — 1}. We redefine Xj to be Xj. 
Then xi, . . . , X3„ are weight vectors in Wh such that A(xj+i) < A(#(xj)) for every 
i G {1, . . . , 3n— 1}. Theorem 15 . 71 implies that x 3n = Wh- The situation is analogous 
at the right end of B. 

In this paragraph we prove that H a = h. Because a is constant on the union of 
the columns of B between C% n -\ and Z?3„_i, the restriction of a to these columns 
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is a sum of skinny cuts. The definition of <P implies that these skinny cuts can be 
extended to the columns of B from Cj to Dk so that the restriction of a to the 
columns from Cj to Dk is a sum of skinny cuts. The definition of a implies that 
these skinny cuts can be extended to all of D so that a is greater than or equal to 
a weight function which is a sum of h skinny cuts. We conclude that H a > h. But 
since the columns of D from Cj to Dk have cr-height h, it is in fact the case that 
H„ = h. 

In this paragraph we prove that A„ = A p . We proceed by contradiction. Suppose 
that A a 7^ A p . Since H a = h = H p and p is optimal, it follows that A p < A a . 
By definition p and a agree away from the columns of B from Cj to Dk- Hence 
the values of p on the tiles of one of the columns C from Cj to Dk determines a 
weight vector in WV for some h! > h whose area is less than the weight vector in 
Wh gotten from the values of u on the tiles of C. If C is between C^ n and D^ n , 
then the weight vector determined by a is Wh- Corollary 13.21 shows that this is 
impossible because the area of Wh is minimal. Thus C is either one of the columns 
Cj, . . . , C?, n or Dk, . . . , D^ n . By symmetry we may assume that C = Cj for some 
i £ {j, . . . , 3n}. Then A(jji) < A(x,). By construction we have that A(xj) < A(yj). 
Since we replaced the original value of Xj by Xj, we have that A(xj) < A(yj), and 
so i > j. Now we have a contradiction to the choice of j. This proves that A„ = A p . 

Now we have that H a = H p and A„ — A p . It follows that a = p. Since a is 
virtually bar uniform, we see that p is virtually bar uniform. 

This proves the dumbbell theorem. 

7. Notes 

In this section we state without proof a number of results related to the dumbbell 
theorem. 

We say that a dumbbell D is vertically convex if whenever x and y are points 
in D such that the line segment joining them is vertical, then this line segment 
lies in D. We say that the sides of D are extreme if the sides of D are contained 
in vertical lines such that D lies between these lines. Suppose that the vertical 
lines determined by our tiling of the plane intersect the rr-axis in exactly the set 
of integers. We say that a skinny cut for D is monotonic if it has an underlying 
path which is the graph of a function and that the restriction of this function to 
the interval determined by any pair of consecutive integers is either monotonically 
increasing or decreasing. If the dumbbell D is vertically convex and its sides are 
extreme, then every optimal weight function for D is a sum of monotonic minimal 
skinny cuts. 

We recall Lemma 14.21 Let R be a rectangle tiled by n rows and m columns 
of squares Tij. Let x = (xn, . . . , x n \) be a weight vector in R". Then there 
exists a unique weight function p for R with minimal area subject to the condi- 
tions that p is a sum of strictly monotonic skinny cuts and p{Tn) — Xi for ev- 
ery i £ {1, . . . ,n}. The skinny cut function <S> is defined so that if m = 2, then 
<P(x) = (p(Ti 2 ), . . . ,p(T n2 )). It is in fact true for every m > 2 that ^^(x) = 
(p(Tij-), . . . , p(T nj )) for every j G {2, . . . , m}. 

Let h be a positive real number, and let x S Wh- Let n(x) = (po, ■ ■ ■ ,p n )- We 
define a nonnegative integer pi for every i G {1, . . . , n — 1}. Let i € {1, . . . , n — 1}. 
If Pi < —, then pi is the number of partition points Pi, . . . ,p n -i in the half-closed 
half-open interval [pi, — )• If Pi > — , then pi is the number of partition points 
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pi,..., Pi in the half-open half-closed interval (^,pi\. If pi — then fii = 0. 
Set /i = max{/ii, . . . , /i„^_i}. Note that fi < n — 1, and that /z = n — 1 for the 
weight vector (0, 0, 0, . . . , h). It turns out that <P m (x) = Wh for some nonnegative 
integer m if and only if m > \x. This implies that <& n (x) — u>h for every weight 
vector x € Wh- In other words, if %i, . . . , x n -\ are weight vectors in Wh such that 
Xi+i = for every i S {1, ...,n— 2}, then x„_i = Wh- Compare this with 

Theorem 15. 71 We see that the integer 3n in Theorem 15 . 71 cannot be replaced by an 
integer less than n — 1. It is in fact true that there exists a positive real number e 
such that the integer 3n in Theorem 15 . 71 cannot be replaced by an integer less than 
(1 + e)n. It is probably true that 3n can be replaced by 3n/2. 

Not surprisingly, the skinny cut function is continuous. What might be sur- 
prising is that it is in fact piecewise affine. If c„ denotes the number of affine pieces 
of <P, then the sequence (c„) satisfies a quadratic linear recursion with eigenvalue 
1 + V2. 

There exists a theory of combinatorial moduli "with boundary conditions" , as 
alluded to by Lemma 14.21 For simplicity we deal with a rectangle R with n rows 
of tiles. Let ft be a positive real number, and let i be a weight vector in R ra with 
height h. Here are two ways to define optimal weight functions with boundary 
conditions. In the first way we minimize area over all weight functions on R with 
fat flow height h such that the values of p on the first column of R give x. In 
the second way we maximize the fat flow modulus over all weight functions on R 
whose values on the first column of R give a scalar multiple of x. It turns out that 
these constructions arc equivalent. They yield a weight function which is unique 
up to scaling. Interpreted properly, most of the results of [1 hold in this setting. 
For example, the optimal weight function is a sum of minimal fat flows (modulo 
the first column). A straightforward modification of the algorithm for computing 
optimal weight functions even applies in this setting. 
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